A new transpose split method for three-dimensional FFTs: performance on an Origin2000 and Alphaserver cluster

نویسندگان

  • P. Wapperom
  • A. N. Beris
  • M. A. Straka
چکیده

We discuss a new transpose split method for parallel computations of three-dimensional Fourier transforms. By splitting the data along two dimensions over the processors it allows for a higher degree of parallelization than the original transpose split method. The traditional transpose split method involves one alltoall communication. The new method involves two communication steps in which each processor performs an ”alltoall” communication in groups. The performance of the new method has been evaluated using MPI on an Origin2000 and an Alphaserver cluster and compared with the traditional transpose split method. We found that the extra communication step introduced in the new method only slightly increases communication time. However, an efficient parallelization depends critically on how fast the communications can be performed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Benchmarks of Turbulencein Complex Geometries ?

In this paper we present benchmark results from the parallel implementation of the three{dimensional Navier{Stokes solver Prism on diierent parallel platforms of current interest: IBM SP2 (all three types of processors), SGI Power Challenge XL and Cray C90. The numerical method is based on mixed spectral element{ Fourier expansions in (x ? y) and z?directions, respectively. Each (or a group) of...

متن کامل

FFTS ON THE VPP 500 3 Platform FFT performance rFFT

Fast Fourier transforms parallelize well but need large amounts of communication. An algorithm which concentrates all the communication in one or two transposition steps is the transpose split algorithm. Di erent transposition algorithms can be used depending on data size and communication latency. A new transpose split algorithm for real and hermitian data is presented for one, two and three d...

متن کامل

Multidimensional, Multiprocessor, Out-of-Core FFTs with Distributed Memory and Parallel Disks

We show how to compute multidimensional Fast Fourier Transforms (FFTs) on a multiprocessor system with distributed memory when problem sizes are so large that the data do not fit in the memory of the entire system. Instead, data reside on a parallel disk system and are brought into memory in sections. We use the Parallel Disk Model for implementation and analysis. Our method is a straightforwar...

متن کامل

A decomposition method with minimum communication amount for parallelization of multi-dimensional FFTs

The fast Fourier transform (FFT) is undoubtedly an essential primitive that has been applied in various fields of science and engineering. In this paper, we present a decomposition method for parallelization of multi-dimensional FFTs with smallest communication amount for all ranges of the number of processes compared to previously proposed methods. This is achieved by two distinguishing featur...

متن کامل

Multi-Dimensional Modeling of the Effects of Spilt Injection Scheme on Performance and Emissions of IDI Diesel Engines

One of the important problems in reducing of pollutant emission from DI and IDI diesel engines is trade-off  between soot and NOx. Split injection is one of the most powerful tools that makes the chance to shift the trade-off curve closer to origin.  At the present work, the effect of split injection on the combustion process and emissions of a cylinder IDI diesel engine under the specification...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Parallel Computing

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2006